K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 67 | 91 | 99 | 99 | 99 |
1000 | 183 | 491 | 724 | 853 | 929 |
10000 | 583 | 1672 | 3617 | 5808 | 7385 |
100000 | 2157 | 10483 | 26029 | 44038 | 61722 |
1000000 | 3119 | 16879 | 46739 | 80785 | 113755 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings